Revisiting 2x2 matrix optics: Complex vectors, Fermion 
combinatorics, and Lagrange invariants 
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Abstract. We propose that the height-angle ray vector 
in matrix optics should be complex, based on a geomet- 
ric algebra analysis. We also propose that the ray's 2x2 
matrix operators should be right-acting, so that the ma- 
trix product succession would go with light's left-to-right 
propagation. We express the propagation and refraction 
operators as a sum of a unit matrix and an imaginary ma- 
trix proportional to the Fermion creation or annihilation 
matrix. In this way, we reduce the products of matrix op- 
erators into sums of creation-annihilation product com- 
binations. We classify ABCD optical systems into four: 
telescopic, inverse Fourier transforming, Fourier trans- 
forming, and imaging. We show that each of these sys- 
tems have a corresponding Lagrange theorem expressed 
in partial derivatives, and that only the telescopic and 
imaging systems have Lagrange invariants. 

1 Introduction 

a. Complex Vectors. In 2 x 2 matrix optics, a ray 
is normally described by a column vector as given by 
Nussbaum and Phillips pQ: 



n<\ 



(1) 



except that we interchanged the coefficients. This equa- 
tion is problematic: a is an angle and not a distance. 
The Cartesian coordinate system is a system for locating 
a point in space in terms of distances from a fixed point 
measured along orthogonal lines. But in what space do 
angles live? In an imaginary vector space? 

Yes. To see why this is so, let us first recall the or- 
thonormality axioms in geometric algebra [2J: 

ef = e 2 k = l, (2) 
e,-e fc = -ejtej, j ^ k, (3) 



where j,k £ {1,2,3}. That is, the square of each unit 
vector is unity and that the product of two perpendicu- 
lar vectors anticommute. Notice that the multiplicative 
inverse of a unit vector is itself. 

We know from Yariv[3] that the paraxial angle a is the 
slope of the function x = x(z): 



dx 
dz 



— tana 



But if we define z = ze% and x 
of geometric algebra, we have 



(4) 

xei, then by the rules 



rfx 
dz 



dx 
dz 



e 3 ei 



dx 
dz 



ie 2 a, 



(5) 



where i = e 16263 is the unit trivector that behaves 
like the unit imaginary numberjl]. Therefore, instead 
of Eq. (TT]), we write 



x 
nai 



(6) 



which is a complex vector like the electromagnetic field 
F = E+iB[S]. Note that we adopted the convention that 
lengths like height and radius are dimensionless. (Al- 
ternatively, we may replace x by £x, where £ is a unit 
quantity with dimension of inverse length). 

b. Fermion Combinatorics. In most matrix optics 
texts, the convention is light travelling left to right. Yet 
the left-acting propagation and refraction matrix opera- 
tors used are multiplied from right to left: 



M'„M', 



N' V 'N-1 ' 



(7) 



So we propose a more logical way: define the matrix op- 
erators to be right-acting^, so that they multiply from 
left to right, in the same direction of the light's propaga- 
tion. That is, 



• • • Mjv-iM 



JV- 



(8) 
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Here, the action of the right-acting matrix is defined 
by the action of its left-acting transpose, as given in 
Symon[7]: 

M T -r = r-M. (9) 

Because the ray operators are 2x2 matrices, we 
may decompose them as a linear combination of single- 
element, unit matrices, as done by Campbell[S] and 
Harris [S]: 



M 



Mnen + M 12 e 12 + M 21 e 21 + M 22 e 22 , (10) 



where 



en 



e2i 



1 





1 



ei2 



e22 



1 





1 



(11) 

(12) 



are the Fermion matrices in Sakurai 10 and Le 
BellacpT]. The dyadics ei2 and e2i may represent either 
the creation operator or the annihilation operator a, 
depending on the column matrix representations of ei 
and e 2 - 

For the ray vector r = xe± + nae 2 , the left-acting 
propagation and refraction matrices are given in Klein 
and Furtak (transposed matrices) [12] : 



T = l + e 12 D/n, 
R = l-e 21 P, 



(13) 
(14) 



where 1 is the unit matrix. Thus, a lens system becomes 
a product of propagation and refraction matrices: 

M = T„R„---T 2 R2T 1 R 1 

n 

= l[(l + e 12 D k /n k )(l+e 21 P k ), (15) 
fe=i 

Later, we shall recast these equations for our com- 
plex ray vectors and right-acting matrices. We shall also 
present new methods for computing the system matrix 
M. These methods are based on the Fermion identies 
satisfied by the four dyadic operators en, ei2, e 2 ±, and 
^22- In particular, we shall study the allowed combina- 
tions of ei 2 Dk/n k and ei 2 P k > and determine the matrix 
component basis e k k' of the product chain. 

c. Lagrange Invariants. The Lagrange theorem 
or the Smith-Helmholtz relationship p~3l [14] is stated by 
WelfordfH] as 



nurj = n u rj 



(16) 



where n is refractive index of the input medium, the ob- 
ject height, u is the angle subtended from the object, and 
•q is the height of the object; their primed counterparts 
correspond to those of the image. The quantity nurj is 
called the Lagrange invariant. Later, using our x and na 



variables, we shall show that we may recast Lagrange's 
theorem in Eq. (|16[) in terms of partial derivatives: 



x' djn'a') 
x d(na) 



= -1; 



constants. 



(17) 



The quantity |x5(na)| is the Lagrange invariant. 

d. Outline. We shall divide the paper into five sec- 
tions. The first section is Introduction. In the second 
section, we discuss the algebra of right-acting matrices 
and their actions on column vectors. In the third section, 
we shall introduce the complex ray vector and its right- 
acting propagation and refraction matrices. We shall use 
the properties of Fermion creation-annihilation matrices 
to compute the system matrices of thin and thick lenses. 
In the fourth section, we shall revisit the classification of 
optical systems: telescopic, Fourier transforming, inverse 
Fourier transforming, and imaging. We shall derive their 
Lagrange theorems and see if we can define their corre- 
sponding Lagrange invariants. We shall also rederive the 
Moebius transform and the Newton's equation for the 
imaging system. The fifth section is Conclusions. 



2 Matrix Algebra 



Let ei and e2 be two orthonormal vectors represented as 
column matrices, 



ei = 'oj' ea= viy 



(18) 



and let en, ei2, eai, and e22 be the four Fermion matrices 
in Eqs. (JTTJ) and ([T^]) . The left and right action of the 
dyadic operators on ei and e2 are defined by the following 
relations [IB): 



^■fj,u ' *^A — 



and 



(19) 
(20) 

(21) 
(22) 



where A, fi, v G {1,2}. Notice that we can rederive these 
relations if we adopt the definitions 



-{lis 



(23) 
(24) 



with the understanding that the dot product takes prece- 
dence over the juxtaposition (geometric) product. 

Let -M be a right-acting 2x2 matrix and let M T - be 
its left-acting transpose: 



Mnen + Mi 2 ei2 + M 2 %e 21 + Af 22 e 2 2,(25) 
Mnen + Mi 2 e 2 i + M 2 iei 2 + M 22 e 22 .(26) 



2 



The action of these two matrices on the vector 



Xiei + X 2 e 2 



are related by 



(27) 



(28) 



where r' is another vector. That is, [5] 



Mn 


M 2 ^ 




Mia 


M 22 , 






fMii 


Mi 2 


s) 


vM 2i 


M 22 



Hence, 



XiMu + x 2 M 2i , 

X\M\% + X 2 M 2 2- 



(29) 



(30) 
(31) 



Notice that the column-column multiplication in the ac- 
tion of right-acting matrices is simpler than the row- 
column multiplication in that of left-acting matrices. 



3 Matrix Optics 
3.1 Ray Cliffor 

Let us define the complex height-angle vector f as 

('. 

\ noti , 



xei + nme 2 



(32) 



Here, the vector e3 as the optical axis pointing to the 
right, ei as pointing upwards, and e 2 as pointing out of 
the paper. We define the light ray to be moving from left 
to right. The height x of the ray is positive if the ray is 
above the optical axis and negative if below. The angle a 
is positive if the ray is inclined and negative if declined. 

(Fig. m 

To define the paraxial angle a more precisely, we use 
sign functions [T71 EH]- If tr is the direction of propagation 
of the light ray as it moves close to the direction of the 
optical axis e3 , then the ray's angle of inclination a with 
respect to e 3 is [19] 



Q 



where 



rr ■ ei 

|o--ei|' 

cos (er • e 3 ). 



(33) 

(34) 
(35) 



The sign function c ax is the relative direction of the <r 
along the axis ei, with +1 meaning along and —1 oppo- 
site. The angle 8 az is magnitude of the angle between a 
and e.3. 



a 2 







X2 


Xl 



■T3 



Si 



S2 



Figure 1: A paraxial ray with height x\ and incli- 
nation angle ai moves to the right by a distance si 
until it hits an refracting surface. The height of the 
ray becomes X2 and its inclination angle changes 

tO Q2- 



3.2 Matrix Operators 

When light propagates, the paraxial meridional angle a 
remains constant. So the ray tracing equations are 



x = x + sa, 
a' = a, 
z = z + s. 



(36) 
(37) 
(38) 



In most texts, the last equation is assumed. 

Using the definition of the ray cliffor f in Eq. 
Eqs. (f3"6"| and (|3~T|) may be combined as 



r' = r ■ M s = f ■ (1+S). 



where 



That is, 



S = — iSe 2 i = — i— e 



12- 



X 

nai 



1 







-is/n 1 



(39) 
(40) 

(41) 



Except for the —i, the propagation matrix is the same as 
that given by Hecht 20J . 

On the other hand, when light refracts the height x 
of the light ray remains constant. So the ray tracing 
equations are 



n a — na — Px, 
z' = z. 

Here, P is the power of the interface [21] 

n — n! 



P 



R 



(42) 
(43) 
(44) 



(45) 
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where 



V ■ e 3 
1*7 -e 3 1 



where M is a 2 x 2 matrix. That is, 



(46) 



is the sign function describing the relative direction of 
the outward normal vector r\ to the spherical interface of 
radius R, with respect to the optical axis e 3 . If c rjZ = +1, 
T) is approximately along e 3 ; if c vz = —1, 77 is opposite. 

Using the definition of the ray cliffor f in Eq. (1321) . 
Eqs. (j42| and (|43|) may be combined into 



= f-(l + P), 



(47) 



where 
That is, 



P = -iPe u 



x 



(48) 
(49) 



/ x \ (l -^P^ 
n'a'z / \nai J \0 1 

Again, except for the — i, the refraction matrix is similar 
to that given by Hecht[20 . 

We may also rewrite the propagation and refraction 
matrices by using the definition of the exponential of the 
matrix A as 



a A 2 A 3 

e A = l + A+ — + — + 



2! 3! 



so that if A =0, then 



e A = l + A. 



(50) 



(51) 



Thus, the exponential of a null-square matrix is the sum 
of the matrix itself and the unit matrix. 

Because ei2 and e2i are null-square matrices, 



0, 



(52) 



then we may use Eq. (|5Tj) to express the matrices M5 
and M P in Eqs. §§§ and gT]) as 



Mo = = e s = 1 



M, 



1 



S = 1 - iSe 2 i, 
P = 1 - iPe 12 . 



(53) 
(54) 



Except for the — i factor, these exponential forms of the 
propagation and refraction operators were used before 
by Simon and Wolf 22J. These forms are useful when 
we have a series of propagations or of refractions. For 
these cases, we only have to add the arguments of the 
exponentials, as what we shall see next. 

3.3 System Matrix 

An optical system may be considered a black box: we do 
not know what is inside. All we know is that, in general, 
the relationship between the input ray f and output ray 
f ' is given by 

f'=f-M, (55) 



\noti 



A -iC 
-iB D 



so that 



= Ax + Bna, 
= —Cx + Dna. 



(56) 



(57) 
(58) 



Notice that these equations are similar to those in the 
literature, save for the sign of C. 

In general, the system matrix M is a product of prop- 
agation and refraction operators (c.f. 23J): 



N 



N 



M =n 



k=l 



k=l 



where 



Sk = -iSke-21 = 
Pk = -iPkeu = 



k=i 



.s k 
-1 — e 2 i, 

n k 

n k - nk+i 
-ic v k ^ e i2 . 



R 



Note that the determinant of M is unity [24 , 

N 

|M| = Y[ \e Sk \\e Pk \ = 1, 
fe=i 

because its factors have a determinant of unity, 



Sfci _ uPfci _ 



= 1. 



(59) 

(60) 
(61) 

(62) 
(63) 



Let us take some special cases. If P& = for all k, 
then 

N N 

m = n M sk = n eSfc = eS = i+s > ( 64 ) 

k=l k=l 

where 



JV 



N 

S = ]TS fc = -ie 21 ^S fc . (65) 
fe=i fe=i 
In other words, the reduced distance[T3j or the index- 
normalized path length S = s/n of a sequence of vertical 
interfaces (parallel to the xy— plane) is equal to the sum 
of the index-normalized path lengths between successive 
interfaces. (Path length is normally defined as ns.) 
On the other hand, if = for all k, then 



JV 



k=l 



k=l 



M = []M ra = [|e Pi =e p = l + P, (66) 

JV 



jv 

-iei2 2jPfc. 



(67) 



where 

JV 

P = ^ r fc 

k=l k=l 

In other words, the total power P of a sequence of refract- 
ing surfaces separated by negligible distances is equal to 
the sum of the individual powers of each interface. 
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3.4 Fermion Combinatorics 

In general, we cannot simply add the arguments of the 
exponentials in Eq. (|59|) because 



(68) 



for all subscripts k and £. So instead of the product- 
of-exponentials form, we shall use the binomial product 
form and study how to expedite its expansion. 

To expand the binomial product form in Eq. (|59p . we 
note several things, which we shall label as rules: 

Rule 1. The only allowed products are those with 
alternating S and P factors, because 



SkSe 



P k Pe 



0. 



(69) 



Rule 2. Since matrix multiplication is not generally 
commutative, then the order of the factors must be pre- 
served. This means that the values of the subscript k 
must be increasing from left to right; if the subscripts 
are the same, Sk should come before P^. 

Rule 3. The matrix Sfc is an — ie 2 \ quantity; P& is a 
— iei2 quantity. Because the allowed products are those 
with alternating S and P factors, then the Fermion basis 
of the product depends only on the first and last factors: 



Sfc • • • Se, 

p k ---s e , 

P k ■ ■ ■ Pi, 



H) L e 2 i, 
H) L e 22 , 
H) L en, 
H) L ei 2 , 



(70) 
(71) 
(72) 
(73) 



where L is the number of factors in the product chain. 
Rule 4. The unit number 1 is a sum of en and e 22 , 



1 = e n 



e 22 - 



(74) 



These four rules let us compute the system matrix in a 
systematic way, by simply listing down the allowed com- 
binations of S and P. 

3.5 Thin and Thick Lenses 

To illustrate our four combinatorial rules, let us compute 
the expansions of M in Eq. (f59|) for k — 1 and k = 2. 



Case k = 1. The matrix 
M 



is 



(1 + Si)(l + Pi) 

i + (Si + p x )+ SiPi. 



In Fermion basis, this is 
M 



eu(l) + e 12 (-i)Pi 
+ e 21 H)S 1 +e 22 (l + H) 2 S 1 P 1 ). 



(75) 



(76) 



That is, 



1 



-iPi 



(77) 



Case k = 2. The matrix M is 

M = (1 + S0(l + Pi)(l + S 2 )(l + P 2 ) 
= l + (Si + Pi + S 2 + P 2 ) 

+ (SiPi + SiP 2 + S 2 P 2 + PiS 2 ) 

+ (S 1 P 1 S 2 + P 1 S 2 P 2 ) + S 1 P 1 S 2 P 2 . (78) 

In Fermion basis, this is 

M = e 11 (l + (^) 2 P 1 5 2 ) 

+ e 12 (-iPi - iP 2 + {-ifPiS 2 P 2 ) 

+ e 21 (-iSx - iS 2 + H) 3 SiPiS 2 ) 

+ e 22 (l + H) 2 ^ + {-ifS^ + H) 2 S 2 P 2 
+ H) 4 ^!^)- (79) 



That is, 



( (1-ASa) 

S\ + s 2 



V 



-S\P\S 2 



i{Pi + P 2 -PiS 2 P 2 ) \ 

/ 



1-SiPi -SxP 2 -S 2 P 2 
+ S\P\S 2 P 2 



(80) 

Check. To check our computations, we set S\ = in 
Eq. (80]) to get 



'(1 - PiS 2 ) -i(Pi + P 2 - Pi5 2 P 2 ) N 

'^1::. : = | ) , (81) 

-iS 2 (1 - S 2 P 2 ) 

Notice that except for the —i, Eq. (f5Tj) is similar in form 
to the system matrix for a thick lens as given in Klein and 
Furtak (they use left-acting matrices and angle-height 
vectors) P5]. 

Furthermore, if we set S 2 = in Eq. I|81[). then we 
obtain the corresponding thin lens matrix 26 : 



M 



thin 



l -i(Pi + p 2 ) 
o 1 



(82) 



Comparison of Eq. (|82p with the refraction matrix oper- 
ator Mp in Eq. (14T))) yields the thin lens power 



P = Pi 



(83) 



which is what we expect. That is, the total power of a 
thin lens is equal to the sum of the powers of its refracting 
surfaces. 
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4 Optical Systems 



Second, the elements of the system matrix are given by 



So far, we have considered the input and output rays to 
be very close to the optical black box. We shall now relax 
this restriction. 

Let the matrix Mbox describe a black box: 



M 



box 



Mn 
-iM 2 i 



-iM 2 2 
M 22 



which satisfies Eq. ([62 



(84) 



(85) 



Notice that unlike the determinant of standard system 
matrices with real coefficients, the determinant of Mbox 
in Eq. ([55]) is not a difference but a sum. 




A = Mn - M 12 S', (88) 

B = M 2 i+ M 22 S' + M n S - M 12 SS', (89) 

C = Mi 2 , (90) 

D = M 22 -M 12 S. (91) 

Now, the input and output rays are related to the sys- 
tem matrix M by Eqs. (|5T|) and ([5"5j) . Our aim is to 
use these equations to give a geometric interpretation to 
the Mbox parameters Mn, M12, M21, and M22- To do 
this, we shall choose from the set {x, x', a, a'} a pair of 
variables and set the other variables constant. We shall 
consider four cases: 



(92) 
(93) 
(94) 
(95) 



x' 


= x'(x)\ 


a 


a' 


= constants 


x' 


= x'{a); 


X, 


a' 


= constants 


a' 


= oc'(x); 


a 


x' 


= constants 


a' 


= a' (a); 


X, 


x' 


= constants. 



4.1 Telescopic System 



Figure 2: An optical system with a black box. 
The input side is at the distance s to the left of 
the box; the output side is at a distance of s' to 
the right of the box. 




To the left of the box at the reduced distance S = s jn 
is the input ray characterized by height x and optical 
angle na. To the right of the box at an reduced distance 
S' = s'/n' is the output ray characterized by height x' 
and dilated angle n'a' . In other words, the system matrix 
M in Eq. (5BJ) is 



A -iC 

-iB D 

1 

-iS 1 



M s M box M y 



M n 
-iM 21 



-iM 12 
M 22 



-iS' 1 



(86) 



Except for the —i, this matrix product is similar in form 
to that of Simmons and Guttmann|27j. (In Ghatak[2"5] 
and in Nussbaum and Phillips [55] the matrix coefficients 
of Mbox are labeled by —a, b, c, and —d.) 

From Eq. (|86[) we can arrive at two conclusions. First, 
because the sytem matrix is a product of matrices with 
unit determinants, then 



|M S | = AB + CD = 1. 



(87) 



Figure 3: In a telescopic system, parallel rays 
emerge as parallel rays. 



If x' = x'(x), with the the angles a and a' constants, 
then the input and output beams are both bundles of 
parallel rays — a telescopic or afocal system. If this is 
true, then we may differentiate Eqs. (|57|) and (|58p with 
respect to x to obtain[3ni 131] 



dx' 

— =A =M n -M 12 S', 
ox 



= C = M 



12- 



(96) 
(97) 



Using these equations, together with Eqs. (|57|) to ([58 
and ([55) to ([9"T]) . we arrive at 



dx 



Mn, 
M 22 . 



(98) 
(99) 



G 



Thus, in a telescopic system, the ratio of the outgoing and 
ingoing rays's index-dilated angles is the angular magni- 
fication M22PI2]; the ratio of the change in the output 
and input heights is M\\. 

If we multiply Eqs. ([98]) and ((99j) , we get 



and (l88l) to (19T1). we arrive at 



dx 



'boa; 



(100) 



because Mi 2 = 0. Equation (|100[) is the Lagrange's the- 
orem for a telescopic system. An alternative formulation 
of this theorem is 



n'a' 5x' = na Sx, 



(101) 



where naSx is the corresponding Lagrange invariant. 
That is, if the angles of the input and output rays are 
positive constants, then the change in height x of the 
input ray is proportional to the change in height of the 
output ray x' . 



4.2 Inverse Fourier Transforming 
System 




Figure 4: In an inverse Fourier transforming sys- 
tem, rays moving out from a point source become 
parallel rays. 



s = 


M 22 _ 




M X2 " 


n'a' 


-M X2 , 




X 




dx' 


|M box | 


d(na) 


Mia 



(104) 
(105) 
(106) 



Thus, in an inverse Fourier transforming system, a point 
light source at the input focal distance / = M 22 jM\ 2 
to the left of the black box transforms to a bundle of 
parallel rays at the output [33] : the ratio of the optical 
inclination angle n'a' of the output beam to the height 
x of the point source is —Mi 2; the change in the height 
x' of the output ray with respect to the change in the 
dilated angle na of the input ray is I/M12. 
If we multiply Eqs. ([Tut]) and pUB]) . we get 



n'a' dx' 
x d{na) 



-1. 



(107) 



Equation (jTUTJ) is the Lagrange's theorem for an inverse 
Fourier transforming system. In terms of differentials, 
Eq. (|107[) may be written as 



-x S(na) 



(108) 



Notice that though we could not define a corresponding 
Lagrange invariant for this system, we can still provide 
a geometrical interpretation to Eq. (|108[) : if the input 
height x and the output angle a' are positive constants, 
then a positive change in the angle a of the input ray 
would result to a negative change in the output height x' 
of the output ray. 

4.3 Fourier Transforming System 



If x' = x'(a), with x and a constants, then the system 
transforms a point source into a beam of paralell rays — 
an inverse Fourier transforming system. If this is true, 
then we may differentiate Eqs. ([57]) and ([55]) with respect 
to the index-dilated angle na to obtain[5ni l3Tj 

dx' 

— — =B= M 21 +M 22 S' 
d(na) 

+ M U S - M 12 SS', (102) 
=D= M 22 -M 12 S. (103) 

Using these equations, together with Eqs. ([ST]) to ([55)1 




Figure 5: In a Fourier transforming system, par- 
allel rays converge to a point. Note that the actual 
angle 8a' is the angle opposite to the one labeled 
in the figure. 



If a' — a'(x), with x' and a constants, then the system 
focuses a beam of parallel rays into a point — a Fourier 
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transforming system. This means that we may differenti- 
ate Eqs. ([57)1 and (|58|) with respect to x to obtam [3"0"Il3"T] 





d(n'a') 
dx 



A = Mn - M 12 S', (109) 
-C = -M 12 . (110) 



Using these equations, together with Eqs. 
and to (f9"Tj) . we arrive at 



53 to (E8 



S' 


M n _ 






M 12 " 


x' 


|M box | 


1 


na 


Ml2 


M12 



(111) 

(112) 



Notice that Eqs. (fTTTj) , (TIT!?]) , and (flTU)) are the conjugate 
relations for Eqs. (|104|) to p06|) . That is, in a Fourier 
transforming system, a bundle of parallel rays at the in- 
put is focused to a point source at the output at the focal 
distance /' = M11/M12 to the right of the black box; the 
ratio of the height x of the focus to the the dilated angle 
na of the input beam is — M12; the change in the dilated 
angle n'a' at the output focus with respect to the change 
in the height x of the input beam is I/M12. 
If we multiply Eqs. (fTTOf and (fm)) . we get 



x' d(n'a') 
na dx 



(113) 



Equation (|113p is the Lagrange's theorem for a Fourier 
transforming system. In terms of differentials, Eq. (1113[) 
may be written as 



x' 5(n'a') — —na5x. 



(114) 



Notice that though we could not also define a correspond- 
ing Lagrange invariant for this system, we could also still 
interpret Eq. (|114[) geometrically: if the input ray angle 
a and the output height x' are positive constants, then 
a positive change in the input ray height x would result 
to a negative change in the output ray angle a' . 



Using these equations, together with Eqs. ([57|) to (|58|) 
and USD) to (GEEl), we arrive atl34ll35l 



5" 
S 



M n S + M 2 i 
M 12 S-M 22 
M 22 S' + M 21 
M 12 S' - Mn ' 



and 



x 

X 



Mn - M12S" 



-1 



M 12 S - Mo 



(117) 
(118) 

(119) 



where we used the identity | Mboa: = 1- Equation (|1 1T[) 
arc the Moebius relations for the object and image 
distances. Equation (|119j) is the definition for lateral 
magnification [36] . 

There are two relations that we can derive from these 
equations: 

First, the product of Eqs. (fTTBT) and pT9)) is 



x' d(n'a') 
x d{na) 



-1, 



(120) 



which is the Lagrange's theorem for the imaging system. 
In terms of differentials, Eq. (|120[) may be written as 



x' 5{n'a') = —x5(na) 



(121) 



Notice that because of the presence of the negative sign, 
the quantity x5(na) is not a Lagrange invariant; rather, 
it is its magnitude |a;<5(rta)|. The geometrical interpreta- 
tion of Eq. (|12ip is as follows: if the input and output 
ray heights are positive constants, then a positive change 
in the input angle a would result to a negative change 
in the output angle a'. (Note: the paraxial angle a is 
measured from the optical axis e3 pointing to the right. 
A positive a is a counterclockwise rotation; a negative a 
is clockwise. Thus, the actual 5a' in Fig. [5] is the angle 
opposite to the one labeled in the figure, i.e., to the right 
of the image focus.) 



4.4 Imaging System 

If a' = a' (a), with x and x' constants, then the system 
transforms light from a point source to another point 
source — an imaging system. If this is true, then we may 
differentiate Eqs. (|57|) and (|58|) with respect to the input 
optical angle na to obtain [30] f3Tj 

=B= M 21 +M 22 S' 

+ M n S - M 12 SS', (115) 

d(n'a') 

W = D = M 22 -M 12 S. (116) 




Figure 6: In an imaging system, rays leaving a 
point source converge to a point. Note that the 
actual angle 5a' is the angle opposite to the one 
labeled in the figure. 
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Second, we may rewrite Eq. (|1 1 9[) as 



5 Conclusions 



(Mi 2 S - M 22 )(M 12 S' - Mn) - |M box | = 1. (122) 



In terms of the focal lengths / in Eq. (|104[1 and /' in 
Eq. fTTTJ) . Eq. (TT2"2"j) becomes 



(S - ^(S 1 - f) = ZZ' 



1 



/Vf2 ■ 



(123) 



This is similar to the Newton's lens equation[38 

ZZ' = FF'. (124) 

Note that the focal lengths / and /' are measured from 
the left-most and right-most side of the optical black box, 
while F and F' are measured from the left and right 
gaussian planes. 



4.5 Summary 

The system matrix M is defined as the ABCD matrix 
operating on input ray vector f to yield the output ray 
vector f' 



in' a' ) \ina 



A -iC 
-%B D 



(125) 



where 



A = 
B = 



dx' 
dx ' 
dx' 



C=-°m, (126) 



D = 



dx 
d(n'a') 



(127) 



d(na) ' d(na) 
Because the system matrix has a unit determinant, then 
dx'd(n'a') d(n'a') dx' 



1 



dx d(na) dx d(na) ' 



(128) 



as given by Goodman. [37] 

The four Lagrange theorems for the four optical sys- 
tem types may be combined into one: 



1 



n'a! dx' 
na dx 
n'a' c 



x d(na) ' 
x' d{n'a') _ 
na dx 
x' djn'a') 
x d(na) 



a, a' = const, 
x, a = const, 
a, x' = const, 
x, x' — const. (129) 



We shall refer to Eq. (|129p as the unified Lagrange the- 
orem for optical systems. 



We used the orthonormality axiom in geometric alge- 
bra to show that the height-angle vector of a light ray 
must be complex. We proposed that the matrix oper- 
ators to this vector should be right-acting, in order to 
follow the sequence of surfaces traversed by a light ray 
as it moves from left-to-right close to the optical axis. 
We showed that the propagation and refraction matrix 
operators may be expressed as a sum of a unit matrix 
and a imaginary non-diagonal Fermion matrix. We de- 
veloped combinatorial rules for finding the product of a 
succession of propagation and refraction matrices, with- 
out doing explicit matrix multiplication. This product is 
the right-acting system matrix with real and imaginary 
coefficients. 

We factored out the system matrix as a product of the 
input propagation matrix, the black box matrix, and the 
output propagation matrix. Based on the coefficients of 
the system matrix, we classified the optical systems into 
four: telescopic, inverse Fourier transforming, Fourier 
transforming, and imaging. We showed that all four sys- 
tems have a corresponding Lagrange theorem expressed 
in partial derivatives, and that these theorems may be 
combined into one. We transformed these theorems in 
terms of Lagrange differentials, which allows us to ge- 
ometrically interpret the effect of the change in input 
variable to the change in the output variable. We showed 
that these differential relations result to the Lagrange in- 
variants only for the telescopic and imaging systems. 

In a future work, we shall revisit paraxial skew ray 
tracing using complex vectors and right-acting matrices. 
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